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(54) Title: SWITCHING FIELD ARRANGEMENT 



(57) Abstract 

The invention 
relates to a switching 
fabric arrangement 
for a packet-switched 
telecommunications 
network. Packets arriving 
at the switching fabric 
are assigned a time stamp 
depending on their time 
of arrival. Time stamp 
information relating to the 
packets to be transported 
is transmitted to at least 
some of the switching 
elements, and in at least 
some of the switching 
elements, the relative order 
of the packets to be sent 
is established on the basis 
of the received time stamp 
information in such a way 
that on each outgoing link 
(IL) from the switching 
element, the packets are in 
the sequence indicated by 
the time stamps. To enable 
the re-sequencing the be 
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implemented simply and further in such a manner that the switching fabric can be easily expanded, a predetermined number of discrete 
values are used as time stamps, and in switching elements in which the relative order of the packets to be sent is established on the basis 
of the time stamps (a) time stamp-related buffers (RSB) are used, (b) incoming packets are stored in the time stamp-related buffers in 
such a way that at least each packet arriving from outside the switching fabric is stored in the buffer corresponding to its time stamp, 
and (c) the time stamp-related buffers are read one buffer at a time to be empty and each packet that is read out is routed to the output 
indicated by the address contained therein. 
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SWITCHING FIELD ARRANGEMENT 



Field of the Invention 

The invention relates generally to switching implemented in a 
5 packet-switched telecommunications network. More particularly, the invention 
relates to a switching fabric arrangement used in such a network, specifically 
an ATM network. 

Background of the invention 

10 To a 'd the understanding of the following description, some terms to 

be used later will first be defined. 

A switching fabric or switching matrix (both terms are used) is 
comprised of a plurality of switching elements which are either identical or 
dissimilar and are interconnected according to a given topology. In the 

15 (English-language) literature of the art, such a switching matrix may also be 
called a "switching network", since the switching elements form a network 
having the given topology. Hence, a switching matrix is considered to have a 
defined form when its switching elements and their interconnections are 
known. 

20 A switching fabric is composed of switching elements by connecting 

a number of switching elements into a network comprising switching elements 
in parallel and in succession. Parallel switching elements make up one 
switching stage. Switching elements in successive switching stages are 
interconnected by internal links in accordance with the above topology. 

25 The term switch is used to denote the entity configured about a 

switching matrix. Hence, a switch can denote any means employed for signal 
switching in a communications network. In the present context, a switch is a 
packet switch as the invention is related to switching in a packet-switched 
telecommunications network, particularly an ATM network. A switch is also 

30 sometimes termed a switching system. 

ATM (Asynchronous Transfer Mode) is a connection-oriented 
packet-switching technique, which has been selected by the international 
organization for telecommunications standardization, ITU-T, as the target 
transfer mode solution for implementing a broadband multimedia network (B- 

35 ISDN). In an ATM network, the problems of conventional packet-switched 
networks (such as X.25 networks) are overcome by transmitting short packets 



BNSDOCID: <WO 9907181A2J. 



WO 99/07181 PCT/FI98/00593 



of a constant length (53 bytes) called cells. Each cell comprises a 48-byte 
payload portion and a 5-byte header. The header comprises, along with other 
data, address information on the basis of which the cell is routed in an ATM 
network. Further discussion of an ATM network herein will be omitted as non- 
5 essential subject to the understanding of the invention. When required, a 
closer description of this topic can be found in international standards and 
textbooks of the art. 

Switches in a conventional TDM network (Time Division Mul- 
tiplexing, aiso called by the name STM, Synchronous Transfer Mode) cannot 

10 be directly implemented to handle the switching in an ATM network. Neither 
are the switching solutions developed for conventional packet networks usually 
suitable as switches for an ATM network. The selection of an optimum ATM 
switching architecture is influenced not only by the fixed cell size and the 
limited functionality of the cell header but also by the statistical behaviour of 

15 the cell stream and the fact that an ATM switch must operate at a very high 
rate (currently typically about 150. ..600 Mbit/s). 

Figure 1 shows schematically an ATM switch seen from the outside. 
The switch has n input lines l v ..l n and m output lines O v ..O m . A cell stream CS 
arrives over each input line to the ATM switch 11. The header of an individual 

20 cell in the data stream is denoted by the reference HD. In the ATM switch, the 
cells are switched from the input line Ij to the output line O js and simultaneously 
the value of the cell header is translated from an incoming value to an 
outgoing value. For this purpose, the switch includes a translation table 12 by 
means of which said header translation is made. It is to be seen from the table 

25 that, for example, all the cells received over line ^ and having a header with a 
value X are switched onto output line O r whereby their header is 
simultaneously given the value K. Cells present on different input lines may 
have headers of equal value; for example, cells received at input line l n with 
the same header value X are also switched onto output line 0 1t but their 

30 header is given the value J on the output line. 

Hence, the main tasks of a switch are: transfer of cells (packets) 
from the input line to the desired output line, and header translation. Occasion- 
ally, as is also evident from the figure, two cells may be simultaneously 
contending for access onto the same output line. For this purpose, the switch 

35 must have buffering capacity to avoid the necessity of discarding cells in such 
a situation. Hence, the third main task of a switch is to provide buffering. The 
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manner in which these three main tasks are performed and in which part of the 
switch the implementation is handled distinguishes different switching solutions 
from one another. 

ATM switching fabrics, on the other hand, can be subdivided into 
5 two classes depending on whether the switching elements used in the fabric 
are buffered or unbuffered. 

In a fabric using unbuffered switching elements, routing of cells 
through the fabric operates at the cell level in such a way that a route is 
separately selected for each cell irrespective of which virtual channel the cell 

10 belongs to. In simplified rendition, proceeding of cells through the fabric may 
be thought of as comprising two steps. In the first step, the cells are sent from 
the input ports through the fabric and in the second step either the switching 
elements or the output ports give an indication to the input port which cells 
were successful in traversing the fabric. The latter step must be performed, 

15 since if the cells possibly attempt to use the same internal link of the fabric, all 
simultaneous cells except one must be discarded since the switching elements 
have no buffers. The problem with such a switching fabric is that the operation 
of all its elements must be mutually synchronized, i.e., all elements and input 
and output ports must be in the above-described steps at the same time. Since 

20 cell transmission is very rapid and in practice even several further steps may 
be necessary besides the two described above, synchronization cannot be 
achieved merely by means of handshaking signals between the elements, but 
all elements must be synchronized from a common clock source, and it is 
difficult to distribute the clock signal to all elements if the switching fabric is 

25 very broad and it has been necessary to divide it among several plug-in units. 
It may also be noted that even a switch provided with non-buffered switching 
elements must have buffering capacity either in the input or in the output ports 
of the fabric to avoid the need of discarding cells contending simultaneously for 
the same output line of the switch. 

30 A switching fabric comprised of buffered switching elements does 

not have the above synchronizing requirement. The selection of the route of 
the cells through the switching fabric can be virtual channel-related. For this 
purpose, however, a record of the load on the internal links of the switching 
fabric must usually be kept. A buffered switching fabric is usually blocking at 

35 the connection level, as to construct it to be non-blocking usually requires so 
much extra capacity that this is no longer economically feasible. In a blocking 
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fabric, the selection of the route for the connection is a highly critical factor 
when it is attempted to reduce the blocking. 

On account of the foregoing, many manufacturers have sought a 
solution from a method in which the route is selected at the cell level also in a 
5 switching fabric provided with buffered switching elements. To avoid the 
necessity of keeping a record of the load on the internal links of the fabric and 
to enable even distribution of the load, the cell route is usually selected at 
random. (The selection may also be done in accordance with a suitable non- 
random algorithm.) In that case, the switching fabric can be constructed to be 

10 non-blocking at reasonable cost. 

Since the route of the cells varies in such random routing, it is 
possible that a cell sent later travels faster through the switching fabric and 
overtakes a cell sent earlier. This is due to the fact that the load on the 
switching elements and thereby also the fill rates of their buffers vary 

15 constantly, and hence also the dwell time of the cell in the fabric varies at 
different times and over different routes. Restoring the order of the cells neces- 
sitates special solutions known as re-sequencing. 

In principle, there are two basic solutions for cell re-sequencing de- 
pending on which part of the switching fabric the re-sequencing is performed 

20 in. The cells can be re-sequenced either after the switching fabric in re- 
sequencing (micro)circuits provided for this purpose (alternative 1), or the re- 
sequencing can be performed already within the switching fabric between its 
switching stages (alternative 2). 

There are several variations of the first alternative; one embodiment 

25 is disclosed in U.S. Patent 5,481 ,536. 

The factor common to the above solutions is that the transit time 
used by the cell in traversing the switching fabric (alternative 1) or to the 
switching elements (alternative 2) is measured for example by using a time 
stamp, and thereafter the cell is delayed for a time sufficient for a predet- 

30 ermined maximum time limit to be exceeded. This ensures that the delay of all 
cells through the switching fabric remains the same. The drawback of these 
solutions, however, is that they require a very complex circuit using parallel 
processing at the output ports of the switching fabric (alternative 1) or in the 
switching elements (alternative 2). 

35 U.S. Patent 5,337,308 discloses a solution relating to alternative 2, 

in which a time stamp dependent on the time of arrival is attached to each cell 
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arriving at the switching fabric. The time stamps of the cells at the head of the 
input buffers are monitored within the switching fabric in a discrete switching 
element, in order to find the minimum value of these time stamps. When one 
of the output buffers is empty, an idle cell is generated in lieu thereof, the time 
5 stamp of which is given said minimum value. By means of these idle cells, the 
switching element can provide time information to a switching element in the 
next switching stage, and on the basis of this information the element in the 
next stage is able to order, with minimum delay, the cells (packets) in the 
element in the correct sequence and to forward them in the correct order. 

10 The principal drawback of the solution disclosed in this U.S. Patent 

also resides in the heavy comparison operations required by the re-se- 
quencing of the cells. When an input buffer receives a cell, the time stamp 
carried by said cell is compared with the time stamps of the cells at the head of 
ail other input buffers. On the basis of the comparison, the cell with the 

15 smallest time stamp is found, as a result of which a transfer command to 
transfer the cell out from the buffer is given to said buffer. Hence, N 
comparisons must be performed per each cell to be transferred, N being the 
number of links to and from the switching element. 

A re-sequencing solution differing from the one described above is 

20 disclosed in U.S. Patent 5,485,457. This solution makes use of the three-stage 
structure of the switching fabric, in which case the cells can still be relatively 
easily ordered in the third stage, even though the solution requires compu- 
tationally heavy comparison operations in the switching elements of the third 
stage to find the smallest time stamp in each case. In this solution, the first 

25 switching stage is unbuffered and an empty cell indicating the emptying of a 
given buffer in the second switching stage is sent from said second switching 
stage to the third switching stage to reduce the switching delay. 

The drawback of this solution, in addition to the heavy comparison 
process it requires, is the fact that it does not permit flexible expansion of the 

30 switching fabric (flexible increasing of the number of switching elements and/or 
switching stages). 

Summary of the Invention 

It is an object of the present invention to eliminate the drawbacks 
35 described above and to provide a method wherewith the re-sequencing of 
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packets can be implemented more simply than heretofore and also in such a 
manner that the switching fabric can be easily expanded. 

This object is achieved with a solution as defined in the independent 

claim. 

5 The idea of the invention is to utilize as time stamps a 

predetermined number of discrete values, and to use time stamp-related 
buffers into which the packets are stored in such a way that each incoming 
packet is stored in the buffer specific to its time stamp. The time stamp-related 
buffers are emptied by reading one buffer at a time. 

10 To minimize the delay, it is advantageous to use in connection with 

such a re-sequencing arrangement a basic solution of the kind described 
above, in which a time stamp dependent on the time of arrival is attached to 
each incoming packet (or cell), and by means of padding packets internal to 
the switching fabric, time stamp information that is substantially continuous 

15 and is formed on the basis of the time stamp information arriving at the 
switching element is transmitted from one switching element to another, 
independently of the arrival of data packets. 

In accordance with another preferred embodiment of the invention, 
packets are read out from the input buffers of the switching element into time 

20 stamp-specific buffers, simultaneously maintaining information on the smallest 
time stamp the packet corresponding to which has been read out from the 
input buffers at each time. The packets are read out from the time stamp- 
specific buffers utilizing this value in such a way that in each case, a time 
stamp-specific buffer whose specific time stamp is smaller than or equal to the 

25 above smallest time stamp, is read. 

The solution of the invention offers a clear and simple re-sequencing 
method and does not require high parallelism (complexity) from the circuit 
implementation. All switching elements can have similar implementation, and 
thus the fabric can be easily enlarged almost without limit, as long as it is 

30 ensured that the maximum delay of the cells will not become too long. 

Moreover, the solution in accordance with the invention does not 
require high buffering capacity in the switching element, for example the input 
buffers can comprise only one cell. Furthermore, by means of the solution the 
switching delay occasioned by the switching fabric will be made small, since 

35 the time information enabling the re-sequencing can be rapidly transmitted 
from one switching element to another. 
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List of Drawings 

The invention and its preferred embodiments will be described more 
closely in the following with reference to examples in accordance with Figures 
5 2-76 in the accompanying drawings, in which 

Figure 1 depicts the basic principle of an ATM switch, 

Figure 2 depicts a switching fabric in accordance with the invention on a 
general level, 

Figure 3 depicts one switching fabric in accordance with the invention, 
10 Figure 4 illustrates time stamping of cells performed at the edge of the 
switching fabric, 

Figure 5 shows the structure of a cell traversing in the switching fabric, 
Figure 6a shows an individual switching element in accordance with the 
invention, 

15 Figure 6b shows a cell queue to be sent to an internal link of the switching 
fabric, and 

Figures 7a... 7d illustrate the operation of the re-sequencing unit included in 
the switching element. 

20 Detailed Description of the Invention 

To elucidate the solution in accordance with the invention, the 
following will first give a description of a switch in accordance with the 
invention on a general level. At this level, the solutions are still known as such. 

The switch can be functionally divided into three parts as shown in 

25 Figure 2: a time stamping unit TSU, a distribution section RRS, and a routing 
section RS. Only the last two of these contain switching elements, and hence 
they form the actual switching fabric (or switching matrix). 

. Ahead of the switching fabric, a time stamping unit TSU for cells is 
provided, which comprises one time stamping unit AL, for each input line I, 

30 (i=1...n). These time stamping units stamp the same time on cells arriving 
substantially simultaneously at the switching fabric. The distribution section 
RRS routes the cells at random or in accordance with a given predetermined 
algorithm in such a way that cells arriving at a given input line are distributed to 
preferably all output lines of the distribution section as evenly as possible. This 

35 enables the cell stream to be evenly distributed among all available paths, 
which will again diminish the internal blocking of the switch. The routing 
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section RS for its part routes the cell to the correct output port on the basis of 
the routing tag contained in the cell. Hence, the routing section is such that all 
cells with a given output port address are routed to said output port 
irrespective of which input port of the routing section they are sent to from the 
5 distribution section. 

In the switch in accordance with the invention, the path traversed by 
the cell is thus comprised of a first part selected at random or in accordance 
with a given algorithm and a second part that is selected on the basis of the 
routing information in such a way that the cell is directed to the correct output 

10 port. Such a structure is previously known. 

The type or detailed structure of the switching fabric is not essential 
to the invention, since the method of the invention can be applied to a wide 
variety of switching fabric types, such as Benes, Clos or Delta-type switching 
fabrics. What is essential in the switching fabric is that it comprises a 

15 distribution section from each individual input line (l j5 j=1 ...n) of which routes to 
preferably all output lines (RO v ..RO k ) of the distribution section can be 
established, and that it comprises a routing section in which at least one route 
from each input line (RO it i=1...k) to the desired output line (O,) (according to 
the routing information) can be established. The relative order of these parts 

20 and their number is not essential to the invention. 

Figure 3 shows a switching matrix in accordance with the invention, 
which in this example is a three-stage Clos-type switch in which both the first 
stage and the third stage have k switching elements SE (numbered from one 
to k). The switching elements in the first stage are denoted by reference SE(1), 

25 the switching elements in the second stage by reference SE(2), and the 
switching elements in the third stage with reference SE(3). Each switching 
element in the first stage has n' inputs and m outputs. Each switching element 
in the third stage for its part has m inputs and n' outputs. The middle stage has 
a total of m switching elements, each having k inputs and k outputs. The 

30 switching elements in the first stage are connected to the second stage in such 
a way that each of the outputs of the switching element is connected to a 
different switching element in the second stage. Furthermore, the switching 
elements in the first switching stage are connected in succession to different 
inputs of the switching element in the second stage. More precisely, the j:th 

35 output of the i:th switching element in the first stage is connected to the i:th 
input of the j:th switching element in the second stage. The second and third 
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stage are interconnected in a corresponding manner, in other words, the j:th 
output of the i:th switching element in the second stage is connected to the i:th 
input of the j:th switching element in the third stage. 

In the exemplary case of Figure 3, the first switching stage serves as 
5 a distribution section (RRS) and the two next switching stages as a routing 
section (RS). 

Irrespective of what type of switching fabric is used, each switching 
element SE(i) of the switch in accordance with the invention comprises 
functionally two parts: a cell re-sequencing unit UJ and a cell switching unit 

10 KY. These units need not necessarily be units clearly distinct from one another 
and the remaining logic, but it suffices that the respective functions are present 
in the switching element. The cell re-sequencing unit UJ orders the cells in the 
correct sequence if they arrive in the incorrect order along different routes. The 
re-sequencing will be described more closely hereinafter. On account of the re- 

15 sequencing performed in the switching element, the cells pass in each internal 
link (one of which is denoted by reference IL in Figure 3) in the order indicated 
by the time stamps, even though the queue may comprise several cells 
provided with the same time stamp in succession. This feature makes the re- 
sequencing remarkably simple, as will be seen below. 

20 Figure 4 illustrates the time stamping performed by the time stamp- 

ing units. The task of the time stamping unit TSU is, as stated previously, to 
provide all cells arriving at the switching fabric at substantially the same time 
with the same time stamp. Cells arrive in succession at each input line in such 
a way that the next cell starts with the rear border of the preceding cell (since 

25 the cells are in practice transported for example in the SDH system, 
Synchronous Digital Hierarchy, in which idle time slots are filled with "idle" 
cells). The time axis is divided into successive time slots TS in accordance with 
the transmission time of the cells, and the time stamp is incremented from one 
time slot to another. In each time slot, the arriving cell receives a time stamp 

30 corresponding to that time slot. In accordance with the invention, a predeter- 
mined number of discrete values, for example integers obtained from a modulo 
counter, are used as a time stamp in such a way that the counter counts time 
slots from zero to N, whereafter it again starts from zero, etc. Hence, all time 
stamping units must be approximately in the same phase, although they need 

35 not be precisely in the same phase. The figure denotes by reference AT the 
phase difference that the different time stamping units can have. In practice, 
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the best alternative is that this phase difference is one cell at most, but the 
method of the invention will be feasible even though the phase difference is 
greater, yet the delay of the cells may become inordinately long in such a 
case. In the figure, the time stamps have values 0...N. 
5 In the time stamping units, the cells also receive a routing tag 

internal to the switch and a field that may indicate whether a padding cell 
internal to the switch or an ATM cell to be transferred to the network is 
involved. Hence, each incoming cell to the switch is as shown in Figure 5. As 
is known, an actual ATM cell comprises a payload of 48 bytes and a header of 

10 5 bytes. Preferably three identifier fields, each of which comprises an identifier 
internal to the switch that is removed at the output of the switch, are inserted 
into this ATM cell. The header field need not necessarily be transported 
through the switching fabric, but it can be removed at the input port of the 
switching matrix and a header field provided with new information inserted at 

15 the output port of the switching matrix. The routing tag field Rl comprises a 
routing tag internal to the switch, wherewith the cell is routed in the switch 
(routing section) in the known manner to the correct output port. The time 
stamp field TS comprises a time stamp inserted by the time stamping unit, 
which is preferably an integer that is obtained from the modulo counter of the 

20 time stamping unit. The type field TF comprises information as to whether the 
cell is a padding cell internal to the switch or some other cell (i.e. a cell to be 
transmitted to the network). It is to be noted that padding cells internal to the 
switch are different from the idle cells that are transported outside the 
switching fabric (in the network). Padding cells can be generated as late as in 

25 the switching fabric, or they can be generated already in the time stamping 
units to replace idle cells arriving from the network, or the generation can be 
performed both in the time stamping units and within the switching fabric. 

The cell distribution performed by the distribution section can be 
implemented for example in such a manner that those bits in the routing tag 

30 which correspond to the switching stages in the distribution section are formed 
randomly or in accordance with a given algorithm, and hence the cells are 
routed in the distribution section at random or in accordance with said 
algorithm. The entire routing tag can be formed at the input port of the 
switching matrix, or the part corresponding to the random distribution stages 

35 can be left unformed, in which case said stages perform the random 
distribution independently. 



BNSDOCID: <WO 9907181A2_I_> 



WO 99/07181 



11 



PCT/FI98/00593 



After the above description of the basic structure of the switching 
fabric, the following will describe the solution in accordance with a preferred 
embodiment of the invention in detail. 

Figure 6a illustrates a possible structure of an individual switching 
5 element SE in the switching fabric. The input of the switching element 
comprises an input buffer unit IB, having n parallel FIFO-type input buffers IB, 
(i=1...n), one for each input line. The re-sequencing unit UJ comprises a cell 
storage unit SM, a control unit CU controlling writing into the storage unit and 
reading out from the storage unit, and a padding cell memory CM in which a 

10 padding cell is stored. The input buffers are connected to the storage unit SM 
via an internal bus B1. The re-sequenced cells are transmitted from the 
storage unit to the switching unit KY, which is constituted by a bus B2 via 
which the cells are transferred, and parallel filter units FU; (1=1... m). An output 
buffer unit OB is incorporated after the filter units, comprising parallel output 

15 buffers OB, (i=1...m) of FIFO type. One filter unit and one output buffer is 
provided for each output line OL t (i=1...m). A corresponding output buffer is 
connected after each filter unit, and a corresponding reading unit RU| (i=1...m) 
reading cells onto the corresponding output line (i.e. internal link of the switch) 
is connected after each output buffer respectively. 

20 Hence, the switching unit KY may be for instance a conventional 

switching unit implemented with a shared bus and provided with filters. The 
switching units operate in two modes: on the first side of the switching fabric 
they operate as random distributors of cells, and on the second side of the 
switching fabric as routing means routing the cells to the correct output port. 

25 For example in the case of Figure 3, the switching units in the first stage 
operate as random distributors and the switching units of the second and third 
stage as routing means. 

For multicast transmissions, the switching unit comprises m parallel 
filtering units. If there was no need for multicast transmissions, one common 

30 filtering unit that would route the cell to the correct output queue would suffice 
in the switching unit. 

In order to make the re-sequencing of cells as easy as possible, 
also the padding cells traversing the switching fabric that are used between 
the switching elements to maintain the internal links of the switching fabric in 

35 cell delineation synchronization are provided with time stamps that are formed 
on the basis of the time stamps of cells arriving at the switching element. Also 
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other cells that are internal to the switching fabric and distinct from ordinary 
ATM cells shall be similarly provided with a time stamp. In some switching 
fabrics, such other internal "padding cells" might be for example fabric control 
cells to be sent to the switching elements. Cell delineation synchronization in 
5 this context means that cells are sent continuously so that there are no "gaps" 
between the cells in which no cells would pass. In other words, the transmitter 
and receiver are synchronized in such a way that no information on cell 
delineation need to be sent separately. 

The control unit CU, which controls the reading from the input 

10 buffers \B t and re-sequencing of cells, also keeps track of which is the smallest 
time stamp carried by the cells currently present in the storage unit SM (or in 
cells read from the input buffers if all cells are not stored in the re-sequencing 
unit). The padding cell memory CM stores a padding cell internal to the 
switching fabric as shown in Figure 5, and the control unit updates the time 

15 stamp of this cell with the value that is currently the smallest time stamp 
contained in the cells to be forwarded from the storage unit SM. Each reading 
unit RUi reads cells at the same constant rate onto the respective link. If an 
output buffer is empty, the pertinent reading unit copies one or more cells from 
the padding cell memory onto the outgoing internal link to maintain the cell 

20 synchronization. Thus, even if no actual data cells arrived at the output port for 
a considerable time (in which case the output buffer is emptied), padding cells 
provided with a time stamp nevertheless exit from that output port. 

Hence, a continuous cell queue in accordance with Figure 6b is sent 
from the switching unit to each internal link (OLj), in which queue each cell 

25 carries a time stamp regardless of whether a padding cell internal to the fabric 
(the value of the type field TF of which is denoted by one) or some other cell, 
such as a normal data cell (the value of the type field of which is denoted by 
zero), is concerned. 

In a conventional prior art switching fabric, padding cells are not 

30 transported through the switching element, as the padding cells can be 
terminated at the input of the switching element and regenerated at the output 
of the switching element since the padding cells are similar. On the other hand, 
in a switching fabric operating in accordance with the invention the padding 
cells are transported up to the re-sequencing unit, or at least a time stamp is 

35 stored from the padding cells for re-sequencing. 
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Since the padding cells in the switching fabric of the invention are 
similar except for the time stamp, it suffices that only the time stamps thereof 
are carried over as information to the re-sequencing unit. Neither are the 
padding cells sent through the switching unit, but a padding cell provided with 
5 the correct time stamp is generated at the output port of the switching element 
by reading the padding cell provided with the correct time stamp out from the 
padding cell memory CM. Only data cells traverse the switching unit KY. 

Each (data) cell read out from the re-sequencing unit is transferred 
via bus B2 to each filtering unit. The filtering units read from the cell routing tag 
10 the bits corresponding to the pertinent switching stage, and the filter units 
whose output port corresponds to the address indicated by the bits pass the 
cell and the other filtering units discard the cell. Instead of the routing tag, an 
address indicating the output port may also be used as routing information in a 
known manner. 

15 Figures 7a... 7d illustrate in detail the operational principle of the re- 

sequencing unit UJ, showing in the different figures the cell reading and writing 
steps by way of example. The storage unit SM comprises N+1 FIFO-type 
buffers RSB, that is, one buffer for each time stamp. The figures show beside 
each buffer the time stamp that the buffer corresponds to. Furthermore, the 

20 figures show input buffers IB^.IB, from which the cells are read out. Each cell 
in the input buffers is denoted by a number corresponding to the value of the 
time stamp carried by said cell. The cells are read out from the input buffers 
into the storage unit by going through the input queues in succession and 
reading the cell at the head of each input queue. If the cell is a data cell, it is 

25 written via bus B1 into the storage unit. The control logic stores the time 
stamps of padding cells but does not write the content of these cells into the 
storage unit. To elucidate the writing and reading mechanism of the buffers 
RSB, the figures only show the re-sequencing of those cells which are present 
at the input buffers in the initial situation shown in Figure 7a. 

30 The control unit CU has two counters, counter X and counter Y, 

which are implemented for example in the form of two registers. The control 
logic CL, controlling the reading and writing, increments the counters and 
utilizes them in the relative phasing of the read and write operations. The 
writing takes place during each reading cycle of the input queues into a given 

35 group of time stamp buffers, and said group changes as the time stamps read 
out from the input queues change. The phasing illustrates from which time 
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stamp buffer the reading out takes place simultaneously as writing into a given 
group of buffers is effected. 

Counter X indicates the smallest time stamp the cell according to 
which was read out from the input queues in conjunction with each reading 
5 cycle (during one reading cycle, a cell is read out from the head of each 
queue, by performing the reading one queue at a time). On account of the 
revolution of the counter, the smallest time stamp is found by comparing the 
read time stamp values with the current value of counter X, for example in 
such a way that the current value of counter X is subtracted from the read time 

10 stamp value, in which case the result modulo(N+1) defines the "smallness", i.e. 
the time distance of the read time stamp value from the current value of X 
measured in the augmenting direction. The smallest time stamp in a reading 
cycle thus signifies the time stamp whose time distance to the current value of 
counter X (i.e. to the smallest time stamp of the previous cycle) is the smallest. 

15 Since in each switching element the cells bound for an individual 

output line are ordered in the correct sequence, the cells in each input queue 
are in the order indicated by the time stamps. Thus, the value of counter X 
need not be decremented, except when the time stamp counter wraps (from N 
to zero). The value of counter Y, on the other hand, indicates the queue of the 

20 re-sequencing unit (time stamp buffer) wherefrom the cells are read out into 
the switching unit. The reading takes place by emptying one buffer (RSB) at a 
time. Each time a queue is emptied, the value of counter Y can be 
incremented by one and the method can proceed to emptying the next queue. 
The control logic CL only need to attend to the fact that the value of counter Y 

25 never exceeds that of counter X. Hence, for counter values Y<X always 
applies. This condition can be maintained in such a way that if the value of 
counter Y tends to become greater than that of counter X, one awaits the 
increasing of the value of counter X to increment the value of counter Y. If 
Y=X, the writing takes place into the same queue that the cells are read out 

30 from. Reading out from the storage means and writing into the storage means 
can be implemented fully independently of each other, as long as said 
condition is maintained. 

In the initial situation of Figure 7a, each counter is presumed to have 
the value zero. 

35 After the first reading cycle of the input buffers, the situation is as 

shown in Figure 7b, that is, the cell at the head of each input buffer has been 
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transferred into the buffer RSB corresponding to the time stamp. Since the 
smallest time stamp that was read out from the input buffers was zero, the 
value of each buffer still remains zero. At this stage, reading out from the 
buffer corresponding to time stamp zero can be commenced. 
5 Figure 7c shows the situation after the second reading cycle of the 

input buffers. Since the smallest time stamp that was read out from the input 
buffers in this cycle had the value one, the control logic increments counter X 
by one. In this example, the value of counter Y remains one integer smaller 
than that of counter X, and hence the value of counter Y does not change yet. 

10 Thus, at this stage the reading of the buffers is still at the buffer corresponding 
to time stamp zero. The cells already read out are checked off. 

In the situation shown in Figure 7d, the third reading cycle of the 
input buffers has been carried out, as a result of which counter X has the value 
2. The buffer corresponding to time stamp zero has been emptied and the 

15 value of counter Y has increased to one, and hence reading takes place from 
the buffer corresponding to time stamp one, into which cells are no longer 
introduced in the course of this time stamp cycle on account of the value of 
counter X. 

In the example of Figures 7a... 7d, to illustrate the cell sequencing 

20 principle all cells are shown as being transferred into the time stamp buffers. In 
practice, however, it is preferable to proceed in such a way that each time it is 
found in connection with reading the input queues that the cell is a padding 
cell, only the time stamp thereof is stored and it is studied, similarly as with 
data cells, whether said time stamp is the smallest in the reading cycle. The 

25 content of the padding cell is not written into the time stamp buffers. 
Alternatively, all cells can be written into the time stamp buffers but the cells 
are not read via bus B2 into the switching unit if it is detected from the identifier 
field TF that they are padding cells within the fabric. The last-mentioned 
alternative is more disadvantageous in that it requires higher buffering capacity 

30 in the re-sequencing unit. 

The value of counter Y can be directly taken as the time stamp by 
means of which the time stamp of the padding cell present in the padding cell 
memory CM is maintained. If one output queue in the switching unit is empty 
and one or more padding cells are read onto said output line from the padding 

35 cell memory, one can be certain that the buffers RSB do not contain cells that 
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would have a smaller time stamp than the value of counter Y. Hence, the 
position of the padding cell in the queue will be as indicated by the time stamp. 

In the solution described above, each input queue (IB;) in the 
switching element can have the length of one cell only if reading out from the 
5 input queue takes place in time with the arrival of cells. 

The length of the queues (buffers) in the re-sequencing unit must be 
at least equal to the number of input ports in the switching element. On the 
other hand, the length need not be very much greater than this, since it is 
highly improbable that a large number of data cells provided with the same 

10 time stamp should arrive at the same switching element. The maximum delay 
of cells through the switching fabric expressed as cells is a sufficient upper 
limit for the number of queues (N+1), but a feasible value may in practice be 
much smaller than this. 

Even though the invention has been explained in the foregoing with 

1 5 reference to the examples in accordance with the accompanying drawings, it is 
evident that the invention is not so restricted, but it can be varied within the 
scope of the inventive idea set forth in the appended claims. The following is a 
brief description of possible variations. 

Even though the preferable method is to use padding cells to carry 

20 time information, it is to be noted that the use of the re-sequencing principle of 
the invention does not necessarily require time stamps to be transported in 
padding cells (i.e., padding cells are not indispensable, but it suffices that the 
time stamps are transported in data cells), and thus the padding cell memory 
CM is not in use either. In that case, it must only be awaited, if one input buffer 

25 in the switching element is emptied, until more data cells arrive thereat before 
the value of counter X can be incremented. There will be an increase in 
delays, but the method of the invention nevertheless works. 

One way of preventing the emptying of the input buffers of the 
switching element (or at least diminishing the likelihood of their being empty) 

30 and thereby of reducing the delay, if it is not wished that the switching element 
generates padding cells provided with time stamps, is to send from the input 
ports of the switching fabric time-stamped padding cells, either randomly to 
some output ports or as multicast transmissions to all output ports. The 
switching elements can transmit these padding cells further if the buffers of the 

35 elements are empty or nearly empty. Otherwise they can discard the padding 
cells in the normal way. 
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In principle, it is also possible to transmit the time stamp related to 
the packet (cell) by another route, for example through a parallel link on which 
the packets are not transported. In that case, the re-sequencing is otherwise 
as explained above, but the time stamp related to the packet is not read from 
5 the packet but from a separate memory location at which the time stamp 
information is stored. The time stamp is thus physically or logically attached to 
the packet. It is also possible that a continuous cell queue is not transmitted on 
the internal link, but still continuous time stamp information is transmitted 
through a parallel link, for example, similarly as was done with padding cells 

10 above. In other words, information on the smallest time stamp read out from 
the input queues continuously passes for example through a parallel link to the 
next switching unit, similarly as it passes in the padding cells. 

Part of the padding cells can also be left without a time stamp, or 
they can be provided with an identifier indicating that the time stamp is the 

1 5 same as in the previous cell. The time stamps can thus also be transmitted as 
information indicating the change of the time stamp value relative to the 
previous time stamp value. 

Also the hardware configuration may vary in many ways. For 
example, the switching element can be implemented in a variety of ways, such 

20 as a space or time divisioned switching element, an input or output buffered 
switching element, or a shared buffer or shared bus switching element. Also 
the switch part of the switching element can be implemented in many ways 
known per se. In addition to the fact that the switch part can be a bus-type 
switch part as described above, in which the cells are transferred via a bus in a 

25 time divisioned manner one cell at a time, the switch part can be implemented 
for example as a space divisioned full mesh switch part with an arbiter at each 
output port, which sees all cells bound for that output port and selects one of 
them according to a given rule and prevents the other cells from being 
transmitted. Also the buffering of the switching element can be realized by 

30 known methods either as input or output buffering or as a combination of 
these. The cells can also be stored in the shared memory of the switching 
element and only the addresses of the cells (referring to the shared memory) 
transferred in the queue. A cell queue, as the other terms used in this connec- 
tion, is to be so construed that the practical implementation of the apparatus 

35 may vary as long as the implementation is logically or functionally analogous to 
the above. Also, if the structure of the switching fabric is such that one or more 



BNSDOCID: <WO_ 99071 81 A2_l_> 



WO 99/07181 PCT/FI98/00593 

18 



switching stages in which the order of the cells does not change can be added 
thereto, there is no need to carry out re-sequencing in all switching elements of 
the switching fabric. The padding cells can also be extracted from the input 
queues of the switching element for processing and stored in separate input 
5 queue-specific memory locations, for instance. When an input buffer becomes 
empty, the time stamps of the padding cells stored in the separate memory 
area can be used for the operation of the control unit in accordance with the 
method described above. 



BNSDOCID: <WO 9907181A2J_> 



WO 99/07181 PCT/F198/00593 

19 



Claims: 

1. A switching fabric arrangement for a packet-switched tele- 
communications network, through which switching fabric arrangement packets 
are sent and which switching fabric arrangement comprises 
5 - several input ports and several output ports, 

- several switching elements (SE), each comprising several inputs and 
several outputs, said switching elements being arranged in succession to form 
several successive switching stages and being interconnected with internal 
links (IL) leading from one switching stage to another, a connection being 

0 established through the switching elements and internal links from an input 
port to the desired output port, 

in accordance with which switching fabric arrangement 

- packets arriving at the switching fabric are assigned a time stamp 
depending on their time of arrival, 

5 - each switching element routes a packet arriving at its input to one or 

several of its outputs on the basis of routing information (Rl) associated with 
the packet, 

- time stamp information relating to the packets to be transported is 
transmitted to at least some of the switching elements, and 

0 - in at least some of the switching elements, the relative order of the 

packets to be sent is established on the basis of the received time stamp 
information in such a way that on each outgoing link (IL) from the switching 
element, the packets are in the order indicated by the time stamps, 
characterized in that 
5 a predetermined number of discrete values are used as time stamps, 

and in switching elements in which the relative order of the packets to be sent 
is established on the basis of the time stamps: 

.- time stamp-related buffers (RSB) are used, 

- incoming packets are stored in the time stamp-related buffers in such 
0 a way that at least each packet arriving from outside the switching fabric is 

stored in the buffer corresponding to its time stamp, and 

- the time stamp-related buffers are emptied by reading one buffer at a 
time, and each packet read out is routed to the output indicated by the address 
contained therein. 

5 2. A switching fabric arrangement as claimed in claim 1, 

characterized in that the time stamp information is transmitted to the 
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switching elements in such a way that from at least some of the switching 
elements, time stamp information is sent to the switching elements of the next 
switching stage by forming the time stamp information to be forwarded from 
the switching element in each case on the basis of the time stamp information 
5 received in said switching element. 

3. A switching fabric arrangement as claimed in claim 2, charac- 
terized in that time stamp information is transmitted substantially continu- 
ously from the switching elements to the switching elements of the next 
switching stage. 

10 4. A switching fabric arrangement as claimed in claim 3, charac- 

terized in that padding packets internal to the fabric, containing time stamp 
information, are transmitted within the switching fabric. 

5. A switching fabric arrangement as claimed in claim 1 , charac- 
terized in that padding packets internal to the fabric, containing time stamp 

15 information with values that are kept the same within the fabric, are transmitted 
from the input ports of the switching fabric. 

6. A switching fabric arrangement as claimed in claim 4, in which the 
packets arriving at the switching element are stored in an input buffer (IBj) 
corresponding to the input, characterized in that in switching elements 

20 in which the relative order of the packets to be sent is established on the basis 
of the time stamps: 

- packets are read out from the input buffers, and at least some of the 
packets read out are transferred from the input buffers to the time stamp- 
related buffers, 

25 - information is simultaneously maintained on the smallest time stamp 

(X) the packet corresponding to which has been read out from the input buffers 
in each case, and 

- packets are read out in each case from a time stamp-related buffer the 
time stamp corresponding to which is smaller than or equal to said smallest 

30 time stamp (X). 

7. A switching fabric arrangement as claimed in claim 6, in which 

-the routed packets are stored in the output buffer (OBj) corresponding 
to their routing address, and 

-packets are read out from the output buffer onto the internal link 
35 corresponding to the buffer, 

characterized in that 
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- a padding packet is stored in the memory (CM) of the switching 
element, for which packet a time stamp is continuously maintained that is the 
smallest time stamp (Y) the packet corresponding to which is read out in each 
case from the time stamp-related buffers, and 

5 - when the output buffer is empty, at least one padding packet is read 

out from said memory (CM) to form a continuous packet queue onto said 
internal link. 

8. A switching fabric arrangement as claimed in claim 6, charac- 
terized in that 

10 - when packets are read out from the input buffers, the time stamp is 

stored from each packet read out, and 

- only the packets that have arrived from outside the switching fabric are 
transferred to the time stamp-related buffers. 

9. A switching fabric arrangement as claimed in claim 6, charac- 
15 t e r i z e d in that in switching elements in which the relative order of the 

packets to be sent is established on the basis of time stamps, packets are 
continuously read out from the time stamp-related buffer the time stamp 
corresponding to which is equal to said smallest time stamp (X). 
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